High-rate data embedding in unvoiced speech

نویسندگان

  • Konrad Hofbauer
  • Gernot Kubin
چکیده

We propose a blind speech watermarking algorithm which allows high-rate embedding of digital side information into speech signals. We exploit the fact that the well-known LPC vocoder works very well for unvoiced speech. Using an auto-correlation based pitch tracking algorithm, a voiced/unvoiced segmentation is carried out. In the unvoiced segments, the linear prediction residual is replaced by a data sequence. This substitution does not cause perceptual degradation as long as the residual’s power is matched. The signal is resynthesised using the unmodified LPC filter coefficients. The watermark is decoded by a linear prediction analysis of the received signal and the information is extracted from the sign of the residual. The watermark is nearly imperceptible and provides a channel capacity of up to 2000 bit/s in an 8 kHzsampled speech signal.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Voicing Cut - off Det for Low Bit - Rate Harmonic

Generally, phonetic classification for low rate speech coding is restricted to either a simple binary voiced/unvoiced classification of entire speech frames, or alternatively, a more complicated estimation of the voicing for a set of frequency bands. A good compromise between these two techniques is estimation of a single cut-off frequency that separates the spectrum into voiced (below) and unv...

متن کامل

Structure-based Speech Classifcation Using Non-linear Embedding Techniques

Usable speech” is referred to as those portions of corrupted speech which can be used in determining a reasonable amount of distinguishing features of the speaker. It has previously been shown that the use of only voiced segments of speech improves the usable speech detection system, and also, that unvoiced speech does not contributes significant information about the speaker(s) for speaker ide...

متن کامل

High Rate Data Hiding in Speech Signal

One of the main issues with data hiding algorithms is capacity of data embedding. Most of data hiding methods suffer from low capacity that could make them inappropriate in certain hiding applications. This paper presents a high capacity data hiding method that uses encryption and the multi-band speech synthesis paradigm. In this method, an encrypted covert message is embedded in the unvoiced b...

متن کامل

A Variable Rate Speech Codec Using Vus Classification

Voiced speech is highly correlated and must be reconstructed accurately in order to sound correct. Unvoiced speech on the other hand is noise like in nature. It can be approximated by white noise coloured by the vocal tract filter. Because of this lack of structure in unvoiced speech sounds, the excitation signal does not have to reproduce the speech signal as accurately as for voiced sounds. T...

متن کامل

Robust Speech Recognition Using a V

In this paper, a voiced-unvoiced measure is used as acoustic feature for continuous speech recognition. The voiced-unvoiced measure was combined with the standard Mel Frequency Cepstral Coefficients (MFCC) using linear discriminant analysis (LDA) to choose the most relevant features. Experiments were performed on the SieTill (German digit strings recorded over telephone line) and on the SPINE (...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006